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<! — StartFragment — > RESULT 3 
AB181388 

LOCUS AB181388 11991 bp DNA linear BCT 05-OCT-2006 

DEFINITION Brevundimonas sp. SD212 carotenoid biosynthesis gene cluster (orfl, 

crtW, crtY, crtl, crtB, orf6, orf7, crtE, idi, crtZ, crtG, orfl2), 

complete cds . 
ACCESSION AB181388 

VERSION AB181388.1 GI:67003493 

KEYWORDS 

SOURCE Brevundimonas sp. SD212 

ORGANISM Brevundimonas sp. SD212 

Bacteria; Proteobacteria; Alphaproteobacteria; Caulobacterales ; 

Caulobacteraceae; Brevundimonas. 
REFERENCE 1 

AUTHORS Nishida,Y., Adachi,K., Kasai,H., Shizuri,Y., Shindo,K., Sawabe,A., 
Komemushi, S . , Miki,W. and Misawa,N. 

TITLE Elucidation of a carotenoid biosynthesis gene cluster encoding a 

novel enzyme, 2, 2 ' -beta-hydroxylase, from Brevundimonas sp. strain 
SD212 and combinatorial biosynthesis of new or rare xanthophylls 

JOURNAL Appl. Environ. Microbiol. 71 (8), 4286-4296 (2005) 
PUBMED 16085816 
REFERENCE 2 (bases 1 to 11991) 

AUTHORS Nishida,Y. and Misawa,N. 

TITLE Direct Submission 

JOURNAL Submitted ( 08-JUN-2004 ) Norihiko Misawa, Marine Biotechnology 
Institute, Applied Science of Molecular Design; 3-75-1, Heita, 
Kamaishi-shi, Iwate 026-0001, Japan 

(E-mail : norihiko .mi sawa@mbio . jp, Tel: 81-193-26-6581, 
Fax:81-193-26-6584) 
FEATURES Location/Qualifiers 
source 1. .11991 

/organism=" Brevundimonas sp. SD212" 
/mol_type="genomic DNA" 
/strain="SD212" 
/db_xref="taxon: 281067" 

/note="Brevundimonas sp. SD212 is distributed from Marine 
Biotechnology Institute as MBIC03018. 
cloning vector: p5Bre2-15" 
CDS complement (1313. .1648) 

/note="orf 1" 
/codon_start=l 
/transl_table=ll 

/product =" hypothetical protein" 
/protein_id="BAD99405. 1" 
/db_xref="GI : 67003494" 

/translat ion= " MPLALLFALAALSSTPAPAADPAKAQPKTSGGIKAAAQSSAPVL 

DDVRVRLRCIGRADGRVSDCWLSESRPGYGFGEAAVALMNGTATPPSRDGGRPIDRP 

FEHTIQFTP" 
gene complement ( 1753 . .2487) 

/gene="crtW" 
CDS complement (1753. .2487) 

/gene="crtW" 

/note="4, 4 ' -beta-oxygenase" 

/codon_start=l 

/transl_table=ll 

/product =" beta-carotene ketolase" 
/protein_id="BAD99406. 1" 
/db_xref="GI : 67003495" 

/ translat ion="MTAAVAEPRI VP RQTWIGLTLAGMIVAGWGSLHVYGVYFHRWGT 
SSLVIVPAIVAVQTWL SVGLFIVAHDAMHGSLAPGRPRLNAAVGRLTLGLYAGFRFDR 
LKTAHHAHHAAPGTADDPDFYAPAPRAFLPWFLNFFRTYFGWREMAVLTALVLIALFG 
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LGARPANLLTFWAAPALLSALQLFTFGTWLPHRHTDQPFADAHHARSSGYGPVLSLLT 

CFHFGRHHEHHLTPWRPWWRLWRGES " 
gene 2567. .3745 

/gene="crtY" 
CDS 2567. .3745 

/gene="crtY" 

/codon_start=l 

/transl_table=ll 

/product="lycopene beta-cyclase" 
/protein_id="BAD99407. 1" 
/db_xref="GI : 67003496" 

/translation "MQAAAPDSSAPDLLLLGGGLANGLLALRLSQVRPELDVRIVEAA 
DRLGGIHTWSFFEADLTPAQRAWIAPLIACRWPGYSVRFPAFERRLVTGYCSVTAERF 
AEAVTQALAGRIVTGAAWSAGPTEAVLADGHRLTARAVIDGRGPTAAPDLALGFQKF 
VGLEVRLTAPHGLKEPIVMDACVDQSGGYRFLYVLPFDDRTLLIEDTRYTDGDDLDHD 
LFRTGVRDYAAQRGWVIETVLREEEGVLPVALDGDIAAHLKRLGPTALSGLRAGLFHP 
TTGYSLPDAVRLADHLAERIEAAPDGPALAQVIRRHARDVWAQRGFYRLLNRMLFRAA 
RPDQRYRVLERFYRLPQPLIERFYAGETTLADKARILSGKPPVPIGAALTCLVERGRA" 

gene 3745. .5226 

/gene="crtl " 

CDS 3745. .5226 

/gene="crtl" 
/codon_start=l 
/transl_table=ll 
/product="phytoene desaturase" 
/protein_id="BAD99408. 1" 
/db_xref="GI : 67003497" 

/ translation="MRAAVIGSGFGGLSLAIRLQTAGIQTTVFEARDLPGGRAYVYKD 
KGYTFDAGPTVITDPSALEELFEGAGRKLSDYVELLPVAPFYRLCWEDGDVFDYVNGQ 
DELDRQIVARNPADKEGYRRFLAYSQDLLKEGYLKLGAVPFLDFASMVKAAPELMRLQ 
AWRSVYDKVAGYIQDEHLRQAFSFHSLLVGGNPFATSSIYALIHALERRWGVWFPRGG 
TGALIQAMVRLFQDLGGEIRLNSPVERITLANGRADGVWGGQALAFDMVASNADVVH 
TYQRLLGQEPRGRKEGARLASKRHSMSLFVIYFGLKRVHPEVRHHTVLFGPRYRELIG 
EIFKGPDLPQDFSLYLHAPTRTDPSLAPEGCDAFYVLAPVPHLASADIDWAVEGPRYR 
DRVLAYLEQHYIPGLTAHLDTCRIFTPVDFRDQLNAHQGSAFSLEPILTQSAYFRVHN 
RDDQIPNLYFVGAGTHPGAGVPGWGSAKATAGLMIEDAGRTA" 

gene 5223. .6155 

/gene="crtB" 

CDS 5223. .6155 

/gene="crtB" 
/codon_start=l 
/transl_table=ll 
/product="phytoene synthase" 
/protein_id="BAD99409. 1" 
/db_xref="GI : 67003498" 

/ trans la tion="MSDAVLDHSRQSMEQGSKSFAAAARLFPAAIRDDAWMFYAWC RH 
CDDE IDGQVLGHGAVGIDPVLAGRKLVELRERTAAALAGEPQTDPVFTAFQRVAARHA 
IPAEEAMDLLQGFEMDVEGRRYDTLEDTLDYAYHVAGVVGVMMARIMGVQDAPTLRRA 
QDLGLAFQLTNIARDWEDAKGGRVYLPGQWLDEAGVPRDQVDQPRHRQAVAHTAQRL 
VAAAEPYYASARWGLRDLNPRSAWAVATARGVYRAIGRHVSRSGATAWDGRTSVDKAG 
KLALVGRGALITLWCKTLDAWREPPPRPALWTHI" 
CDS complement (6162 . .7226) 

/note="orf 6 " 
/codon_start=l 
/transl_table=ll 

/product =" hypothetical protein" 
/protein_id="BAD99410. 1" 
/db_xref="GI : 67003499" 

/ translat ion="MAVAGAWARGFWRDRSGMALTLVLPPLVYLLFAAVFGAGARGDI 
DASVALHDAARTPAAAAVGEAL SRDLGPRL RRL D D AQ A VE RA V I DGRADAGVL ITRDD 
AGAP Q L T WSAAGRDVAAAALQ ARL VPAAAAL AGRPAP P S RD VPVRQ VGPQ GDVQAAY 
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YAGAVSVMFVFFAAMHGAMGGLDERRSGLQARLALARGGTAPILAGRAIWLTTVGLMQ 
SAAVFLAAWPRLPDLAFWQAAAALLTAALVALCAAGVALALTAACRSREQAQPLTTFV 
ALLLAALGGSMAPRFLMPEAFRALGWITPHAWAVETYQALIWRGDFNAGVLAGWAALG 
GVGLAGIALAVMMEHRRAGR" 
CDS complement (7226. .8173) 

/note="orf 7" 
/codon_start=l 
/transl_table=ll 

/product =" hypothetical protein" 
/protein_id="BAD99411. 1" 
/db_xref="GI : 67003500" 

/ translat ion= "MSFI SSGRPNLDASVAPALVLKDVEAGYGEGPVLRDFNLTVAAG 
EVYALLGPNGAGKSTAARVACGLIPVRRGRATAGEGAPRRGRIGLAPQDCALFPALTP 
RENVAAVAALCGLPRSNRPAAVERALALTGCAPRADEPVHSLSGGWRRRANLSAALVG 
RPRLLIADEPTEGVDAAARWLSSALRETVQAGAGCLL I SHDEAFVSETADRIGVLAQ 
GRLLAEGPPKALLRQAFGLARL LWKLPRPASRLAASRFAQSGLAPSKDGLEWRLLCE 
DALGAAQRVSPEADAEGGEVAVRRPGLDDLVAHL SDAS S " 

gene complement ( 8133 . .9029) 

/gene="crtE" 

CDS complement (8133. .9029) 

/gene="crtE" 

/note="geranylgeranyl diphosphate synthase" 

/codon_start=l 

/transl_table=ll 

/product="GGPP synthase" 

/protein_id="BAD99412. 1" 

/db_xref="GI : 67003501" 

/translat ion="MAIVGLRPQPVSDPEPQSPENLRGLVQDRLAETAPSSDGLLALA 
AREALLGPGKRVRPWAMLAAAHVGGRAEDALDFGCAVEMAHAASLVLDDLPCMDDAA 
LRRGQPTLHRRHGEDAAVLAAVALLNQSTRLILQSRAPSEARLGALDDLTQAIGFDGL 
AEGQMRDLRDDPVQRDWALRRINDLKTGALFVAAARGGGRMGGGDADDLARLAAFGE 
AVGFAFQLCDDLMDACSTSEALGKDVGQDQGVTTFVDLWGEGRVRAGVRQSLARAAEA 
VGHDSPLTTYVLHLFRQAELGR" 

gene 9169. .10221 

/gene="idi" 

CDS 9169. .10221 

/gene="idi" 

/note="isopentenyi diphosphate isomerase" 

/codon_start=l 

/transl_table=ll 

/product="IPP isomerase" 

/protein_id="BAD99413. 1" 

/db_xref="GI : 67003502" 

/translat ion="MPTPDDALIRRKDEHIDHVRAGRGLSGASSGLEAVRFVHDALPD 
LALDQIDLSARFLGRRLNLPFLISSMTGGPSRAEAINARLAEAAQALGVALAVGSQRV 
ALETAGGSGGSGLGPDLRRRAPDALILANLGAVQFALGYGVDEARRAMEMIGADALIL 
HLNPLQEGVQPEGDRDWRGVAQGIERIAAAFPGQWVKETGAGLSAAVARRLADMGVA 
ALDVAGAGGTNWGLIEGARATGGRAEALAAPFADWGVPTARSLRDCAQAAPDLGLIGS 
GGIKDGLDAARAVRLGADLVGQAAGVLEAALTSTQAWDHFELMAAQLRLACFCTGSA 
DLAALAQAPLLEEPRF " 

gene complement ( 10188 . .10673) 

/gene="crtZ" 

CDS complement (10188. .10673) 

/gene="crtZ" 

/note="3, 3 ' -beta-hydroxylase" 

/codon_start=l 

/transl_table=ll 

/product =" beta-carotene C3 -hydroxylase " 
/protein_id="BAD99414. 1" 
/db_xref="GI : 67003503" 

/ translat ion="MAWLTWIALFLTAFLGMEAFAWIMHRYVMHGFLWSWHRSHHEPH 
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DHPLEKNDLFAWFAAPAIVMVAVGLHLWPWALPVGLGITAYGMVYFFFHDGLVHRRF 
PTGFSGRSGFWTRRIQAHRLHHAVRTREGCVSFGFLWVRSARALKAELAQKRGSSSSG 
A" 

gene 10748. .11521 

/gene="crtG" 
CDS 10748. .11521 

/gene="crtG" 

/note="2, 2 ' -beta-hydroxylase" 

/codon_start=l 

/transl_table=ll 

/product="carotenoid C2-hydroxylase" 
/protein_id="BAD99415. 1" 
/db_xref="GI : 67003504" 

/translat ion= "MLRDLL I TTLALSLI IGLRYLLVGAAAHGLLWAGAGRGRALNLR 
PPAMKRIRAEIVASLIACPIYALPAALVLELWKRGGTAI YSDPDAWPLWWLPVSLIVY 
LLAH DAFYYWVHRALHHPRVFGWAHAEHHRSRDPSAFASFAFDPAEAAATAWFLPALA 
LIVPIHWGVALTLLTLMSLTAALNHAGREVWPAAWLERAPLRWLITATHHDAHHKRFN 
GNYGLYFQFWDRWAGTEVSAAPSPPSPVIPPERPSAPLR" 
CDS complement (11416 . .11784) 

Query Match 100.0%; Score 774; DB 14; Length 11991; 

Best Local Similarity 100.0%; Pred. No. 2.2e-170; 

Matches 774; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

iy 1 ATGTTGAGGGATCTGCTCATCACCACCCTGGCGCTGAGCCTGATCATCGGCCTGCGCTAT 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
'b 10748 ATGTTGAGGGATCTGCTCATCACCACCCTGGCGCTGAGCCTGATCATCGGCCTGCGCTAT 10807 

y 61 CTGCTGGTCGGCGCGGCGGCCCATGGGCTGCTGTGGGCCGGGGCGGGCCGGGGACGGGCG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 10808 CTGCTGGTCGGCGCGGCGGCCCATGGGCTGCTGTGGGCCGGGGCGGGCCGGGGACGGGCG 10867 

y 121 CTGAACCTGCGGCCGCCGGCGATGAAGCGCATCCGCGCCGAGATCGTCGCCTCCCTGATC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 10868 CTGAACCTGCGGCCGCCGGCGATGAAGCGCATCCGCGCCGAGATCGTCGCCTCCCTGATC 10927 

y 181 GCCTGCCCCATCTACGCCCTGCCGGCGGCCCTGGTGCTGGAGCTGTGGAAGCGGGGCGGG 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 10928 GCCTGCCCCATCTACGCCCTGCCGGCGGCCCTGGTGCTGGAGCTGTGGAAGCGGGGCGGG 10987 

y 241 ACGGCGATCTACAGCGATCCCGACGCCTGGCCCCTGTGGTGGCTGCCGGTCAGTCTGATC 300 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 10988 ACGGCGATCTACAGCGATCCCGACGCCTGGCCCCTGTGGTGGCTGCCGGTCAGTCTGATC 11047 

y 301 GTCTATCTGCTGGCGCACGACGCCTTCTACTACTGGGTGCACAGGGCCCTGCATCACCCG 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 11048 GTCTATCTGCTGGCGCACGACGCCTTCTACTACTGGGTGCACAGGGCCCTGCATCACCCG 11107 

y 361 CGCGTCTTCGGCTGGGCCCATGCCGAACACCACCGGTCGCGCGACCCCAGCGCCTTCGCC 420 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 11108 CGCGTCTTCGGCTGGGCCCATGCCGAACACCACCGGTCGCGCGACCCCAGCGCCTTCGCC 11167 

y 421 TCCTTCGCCTTCGACCCGGCCGAGGCTGCGGCCACCGCCTGGTTCCTGCCCGCCCTGGCC 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
b 11168 TCCTTCGCCTTCGACCCGGCCGAGGCTGCGGCCACCGCCTGGTTCCTGCCCGCCCTGGCC 11227 

>y 481 CTGATCGTGCCGATCCACTGGGGCGTGGCCCTGACCCTGCTGACGCTGATGTCGCTGACG 540 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
b 11228 CTGATCGTGCCGATCCACTGGGGCGTGGCCCTGACCCTGCTGACGCTGATGTCGCTGACG 11287 

y 541 GCCGCCCTGAACCATGCGGGGCGCGAGGTCTGGCCCGCCGCCTGGCTGGAGCGGGCGCCG 600 
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Db 11288 GCCGCCCTGAACCATGCGGGGCGCGAGGTCTGGCCCGCCGCCTGGCTGGAGCGGGCGCCG 11347 

Qy 601 CTTCGCTGGCTGATCACCGCCACCCACCACGACGCCCACCACAAGCGGTTCAACGGAAAC 66 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 11348 CTTCGCTGGCTGATCACCGCCACCCACCACGACGCCCACCACAAGCGGTTCAACGGAAAC 11407 

Qy 661 TACGGCCTCTATTTCCAGTTCTGGGACCGCTGGGCCGGGACTGAGGTTTCGGCCGCCCCC 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 11408 TACGGCCTCTATTTCCAGTTCTGGGACCGCTGGGCCGGGACTGAGGTTTCGGCCGCCCCC 11467 

Qy 721 TCGCCACCATCCCCGGTCATCCCTCCAGAGCGGCCCTCAGCGCCTCTTCGGTGA 774 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 11468 TCGCCACCATCCCCGGTCATCCCTCCAGAGCGGCCCTCAGCGCCTCTTCGGTGA 11521 

< ! — EndFragment — > 
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